Parallel Svd Computation in Updating Problems of Latent Semantic Indexing ∗
نویسندگان
چکیده
In latent semantic indexing, the addition of documents (or the addition of terms) to some already processed text collection leads to the updating of the best rank-k approximation of the term-document matrix. The computationally most intensive task in this updating is the computation of the singular value decomposition (SVD) of certain square matrix, which is upper or lower triangular, and contains a diagonal block in its upper left corner. For the solution of this task, the new dynamic ordering of subproblems is compared with the up-to-now preferred static cyclic one in the parallel two-sided block-Jacobi SVD algorithm. The results of numerical experiments show that, for a given accuracy, the dynamic ordering is much more efficient that the static cyclic one with respect to the number of parallel iteration steps needed for the convergence of the two-sided block-Jacobi SVD algorithm.
منابع مشابه
Matrices with Low-Rank-Plus-Shift Structure: Partial SVD and Latent Semantic Indexing
We present a detailed analysis of matrices satisfying the so-called low-mnk-plus-shift property in connection with the computation of their partial singular value decomposition. The application we have in mind is Latent Semantic Indexing for information retrieval where the termdocument matrices generated from a text corpus approximately satisfy this property. The analysis is motivated by develo...
متن کاملParallel Methods in Updating/Downdating Problems of the Latent Semantic Indexing
In the case of large databases, which are encoded on some sort of a parallel computer (e.g., a supercomputer or a cluster of personal computers), it is frequently needed to update or downdate either documents or terms. This task can be done in parallel based on the theory of the Singular Value Decomposition (SVD) of a Term-Document Matrix (TDM). However, the TDM is usually not explicitly stored...
متن کاملA Novel Study for Summary/attribute Based Bug Tracking Classification Using Latent Semantic Indexing and Svd in Data Mining
This paper presentsa Latent Semantic Indexing (LSI) method for learningBug tracking concepts in document data. Each attribute in a vector provides the mark of participation of the document in data or term in the parallel concept .The objective to describe the concepts summary based, but to be capable to signify the documents and relations in a combined way for showing document-similarity, docum...
متن کاملEecient Singular Value Decomposition via Improved Document Sampling Eecient Singular Value Decomposition via Improved Document Sampling
Singular value decomposition (SVD) is a general-purpose mathematical analysis tool that has been used in a variety of information-retrieval applications. As the size and complexity of retrieval collections increase, it is crucial for our analysis tools to scale accordingly. To this end, we have studied the application of a new theoretically justiied SVD approximation algorithm to the problem of...
متن کاملClustered SVD strategies in latent semantic indexing q
The text retrieval method using latent semantic indexing (LSI) technique with truncated singular value decomposition (SVD) has been intensively studied in recent years. The SVD reduces the noise contained in the original representation of the term–document matrix and improves the information retrieval accuracy. Recent studies indicate that SVD is mostly useful for small homogeneous data collect...
متن کامل